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ABSTRACT 



PROBLEM TO BE SOLVED: To provide a document group label creation device and a method 
linguistically analyzing language attributes of document groups and automatically providing a label 
showing its contents without individually reading the contents of the document groups and to provide a 
recording medium. 

SOLUTION: This document group label creation device 1 allows a language analysis part 3 to analyze the 
language attributes of a plurality of text document groups comprising a plurality of text documents 
collected and stored in a text document group storage part 2, measures the analyzed language attribute 
information, selects language phenomena characteristic to the text document group based on the measured 
result, collates the selected characteristic language phenomena with a label creation rule of a label creation 
rule dictionary storage part 6 storing the label creation rule, and creates the label showing the contents of 
the text document group for the text document group by a label creation part 4. This constitution thus 
automatically provides the label showing the content by an expression easily understood by the user 
without individually reading the contents of the large amount of document groups. 
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Alerting Abstract US Al 

NOVELTY - A volume manager (112) manages electronic files and metadata related to the files of several 
volumes. The files are organized based on the metadata including automatically generated metadata and 
user defined metadata. 

USE - Electronic files management system for computer system, peer-to-peer system, personal digital 
assistant (PDA), MP3 player, cellular phone, electronic gaming system, for use in data recovery after 
disaster. 

ADVANTAGE - Facilitates categorization of information, and enables to perform event-driven actions for 
sharing and accessing of files, and backup and restoration operations of the electronic file. Enables to 
generate historical view showing modifications made on the file, and restoring of files and folder prior to 
the modification. 

DESCRIPTION OF DRAWINGS - DESCRIPTION OF DRAWING - The figure shows the block diagram 
of the server system with file management system. 

110 server 

112,121 volume managers 

120 terminal device 

130 wireless communication link 



Original Publication Data by AuthorityArgenlinaPublication No. Original Abstracts: An automated 
method for conditionally propagating nicladata instances among documents includes defining groups of 
metadata instances and includes assigning different propagation coefficients to each group. Each 

propagation coefficient assigned to a particular An automated method for conditionally propagating 

metadata instances among documents includes defining groups of metadata instances and includes 



assigning different propagation coefficients to each group. Each propagation coefficient assigned to a 
particular group is... Claims:What is claimed is:l. An automated method of conditionally propagating 
metadata among documents comprising:defining a pluraUty of groups of metadata instances;assigning a 
pluraUty of propagation coefficients to each said group, where each said propagation coefficient assigned... 
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Alerting Abstract US Al 

NOVELTY - The method involves extracting signatures from a corpus of documents and identifying 
similarities between the signamres. The related signatures are clustered hierarchically to generate concepts, 



where the concepts are clustered hierarchically to obtain concept hierarchy. The concepts that are organized 
in the concept hierarchy are labeled. An interface is created for the generated concept hierarchy. 
DESCRIPTION - An INDEPENDENT CLAIM is also included for a system for discovering a hierarchy of 
concepts from a corpus of documents. 

USE - Used for discovering a hierarchy of concepts from a corpus of documents. 

ADVANTAGE - The method automatically discovers a concept hierarchy that organizes concepts into 

multi levels of abstraction, from a corpus of documents. The signatures are frequency distributed to 

alleviate any inaccuracy in the similarity measure resulting from data sparsity and polysemy problems. The 

user-friendly interface allows user to efficiently retiieve relevant documents pertaining to a concept in the 

concept hierarchy and facilitates users in browsing and navigating information content of a corpus of 

documents. 

DESCRIPTION OF DRAWINGS - Tlie drawing shows a flow chart of a method for automatically 
identifying concepts and generating a concept hierarchy. 

Original Publication Data by Authority ArgenlinaPublication No. Original Abstracts:The invention is a 
method, system and computer program for automatically discovering concepts from a corpus of documents 
and automatically generating a labeled concept hierarchy. The method involves extraction of signatures 

from the corpus of documents. The similarity between signatures is computed method, system and 

computer program for automatically discovering concepts from a corpus of documents and automatically 
generating a labeled concept hierarchy. The method involves extraction of signatures from the corpus of 

documents. The similaiity between signatures is computed using a statistical measure. The method, 

system and computer program for automatically discovering concepts from a corpus of documents and 
automatically generating a labeled concept hierarchy. The method involves extraction of signatures from 
the corpus of documents. The similarity between signatures is computed using a statistical measure. The 

frequency distribution of signatures is clustered to generate concepts and concepts are arranged in a 

concept hierarchy. The concept hierarchy automatically generates query for a particular concept and 

retrieves relevant documents associated with the concept method, system and computer program for 

automatically discovering concepts from a corpus of documents and automatically generating a labeled 
concept hierarchy. The method involves extraction of signatures from the corpus of documents. The 
similarity between signatures is computed using a statistical measure. The frequency distribution of 
signatures is refined to alleviate any inaccuracy.. .Claims:claimed is:l. A method for automatically 
discovering a hierarchy of concepts from a corpus of documents, the concept hierarchy organizes concepts 
into multiple levels of absfraction, the method comprising:a. extracting signatures from the corpus... 
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Alerting Abstract US Bl 

NOVELTY - The method involves providing a tabular layout structure for a site map (102) in a site map 
form. A column layout structure for table of contents is provided in a table of contents form. The site map 
form and the table of contents form are provided in respective data fields for receiving data from a agent 
dynamically in response to a request from a user for displaying a site view e.g. site map and table of 
contents. 

DESCRIPTION - INDEPENDENT CLAIMS are also included for the foUowing: 



a. a system for displaying from a web site a site view selectively including a site map and a table of 

contents 

b. a program storage device readable by a machine tangibly embodying a program of instructions for 

performing a site view e.g. site map, displaying method. 

USE - Used for displaying a site view e.g. site map and table of content, from a website e.g. lotus domino 

web site. 

ADVANTAGE - The method effectively provides an improved site map and table of contents for a web 
page, and dynamically updates the web page site map and the table of contents to link directly with a 
volatile content. The method efficiently enables browser linking of site content to the site map and table of 
contents. 

DESCRIPTION OF DRAWINGS - The drawing shows a high level system diagram of a system for 
displaying a site view such as a site map and table of contents. 

101 Lotus domino web site 

102 Sitemap 

104 Table of contents 
106 Content database 
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NOVELTY - The dictionary including a subset of words contained in a document set and count of 
frequency occurrence of each word in the document set are generated. The set of documents are partitioned 
into multiple clusters for which the name and centroid in the dictionary space are generated. The cohesion 
and distinctness scores are generated for each cluster. A table including the name, cohesion score and 
distinctness score for each cluster is displayed. 

DESCRIPTION - INDEPENDENT CLAIMS are included for the following: 

c. Interactive classification and analysis system; and 

d. Computer program product for interactive classification and analysis. 

USE - For interactive classifying and analyzing textual data in helpdesk service. 
ADVANTAGE - Clustering of documents enables a user to determine the content of documents in the 
cluster without having to look at all of the documents. This saves the user's considerable time and 
ultimately reduces expenses. Enables identifying candidate helpdesk problem categories that are most 
amendable to automated solutions and hence improves the efficiency of the helpdesk operation. 
DESCRIPTION OF DRAWINGS - The figure shows a flow diagram of the interactive classification and 
analysis process. 



Alerting Abstract ... and ultimately reduces expenses. Enables identifying candidate helpdesk problem 
categories that are most amendable to automated solutions and hence improves the efficiency of the 
helpdesk operation... Original Publication Data by Authority ArgentinaPublication No. ...Original 
Abstracts:data. It is particularly useful in identification of helpdesk inquiry and problem categories 
amenable to automated fulfillment or solution. A dictionaiy is generated based on a frequency of 

occurrence of words a centroid, a cohesion score, and a distinctness score are generated for each cluster 

and displayed in a table. The documents contained in the clusters sorted based on their similarity to other 
documents in the cluster by calculating the distance of the document to the centroid of the cluster and 



the documents may be sorted in order of ascending or descending distance of the document to the centroid 

of the Claims: set of documents into a pluraUty of clusters, each cluster containing at least one 

document;generating a name for each cluster;generating a centroid of each cluster in the space of the 
dictionary;generating a cohesion score for each cluster;generating a distinctness score for each cluster; 
anddisplaying a table including the name of each cluster and the cohesion score and distinctness score for 
each cluster. 
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Original Titles: Method and apparatus for managing image categories in a digital camera to enhance 
performance of a high-capacity image storage media Alerting Abstract ...media has been determined that 
the image was deleted, added or recategorized. The list is automatically synchronized when the storage 
media is under control of a digital imaging device. Original Publication Data by 

Authority ArgentinaPubUcation No. ...Original Abstracts:method aspect of ihc picsent invention includes, 
storing images on the storage media, where the images include tags for calegori/iiii; ihc images. A 
category list is generated from the tags in the images and the category list is sioied on ihe storage media. 
A category within the category list is designated as a default category. The mellioti rnrlhcr includes 
accessing the image category list in order to display the images, wherein only the images belonging to 

the default category ai'e displayed, thereby speeding access lo, and display of, the (,"laims:media in a 

digital imaging device, comprising:a) storing images on the storage media, the images including lags for 
categorizing the images;b) generating an image category list from the lags in the images and storing the 
category list on the storage media, the image category list including a default category;c) accessing the 
image category list in order to display the images, wherein only the images belonging lo the default 
category are displayed, thereby speeding access to. ami (lispl;i\ ol'. ilie images: andd) synchroni/ing the 
image category list with the images on the storage iiic(li:i w licii ii li;is been determined that at least one 
image was deleted, added, or recategorized, wherein the syuchronizatiou iueludes,i) automatically 
synchronizing the image category Ust when the storage media is under control of the digital imaging 
device, andii) performing a subsequent synchronization of the image category... 
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Detailed Description: 
...OF THE DRAWINGS 

Figure 1 is a block diagram illustiating an online photosharing system that automatically stores and 
organizes digital files in accordance with a prefened embodiment of the present invention. 



Figure 2 is a flow diagram of the process for automatically generating 

names for the image files uploaded to the photosharing site in accordance with a preferred embodiment... 
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automatically selected as knowledge categories for the document. 

At a block 44, the knowledge categories selected, either automatically or by the user, are automatically 
assigned as metadata for the document. Assigning the knowledge categories as 
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Claims: ...of displaying comprises displaying using a file system presenter/editor whereby action items 
may be created, edited, opened, named, copied, or deleted displayed action items. 

33. A method for visually representing content and structure the file system an interactive system 

application containing at least one event-actuatable action item; 

displaying, in a hierarchical arrangement, a graphical representation of the at least one event-actuatable 
action item; and 

displaying an event identifier... 
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Title: Semi-automatic photo annotation strategies using event based clustering and clothing based 

person recognition 

Author Bongwon Sub; Bedeison, B.B. 

Author Affiliation: Univ. of Maryland, College Park, USA 

Journal: Interacting witli Computers vol.19, no.4 p. 524-44 

Publisher: Elsevier Science Ltd. , 

Publication Date: 2007 Country of Publication: UK 

CODEN: INTCEE ISSN: 0953-5438 

Material Identity Number: ES42-2007-001 
Item Identifier (DOI): 10. [0i6,'j.inicom.2007.02.002 
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Language: English Document Type: Journal Paper (JP) 
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Abstract: Managing a large number of digital photos is a challenging task for casual users. Personal photos 
often dont have rich metadata, or additional information associated with them. However, available 
metadata can play a crucial role in managing photos. Labeling the semantic content of photos (i.e., 
annotating them), can increase the amount of metadata and facilitate efficient management. However, 
manual annotation is tedious and labor intensive while automatic metadata extraction techniques often 
generate inaccurate and irrelevant results. This paper describes a semi-automatic annotation strategy that 
takes advantage of human and computer strengths. The semi-automatic approach enables users to 
efficiently update automatically obtained metadata interactively and incrementally. Even though 
automatically identified metadata are compromised with inaccurate recognition errors, the process of 
correcting inaccurate information can be faster and easier than manually adding new metadata from scratch. 
In this paper, we introduce two photo clustering algorithms for generating meaningful photo groups: (1) 
Hierarchical event clustering; and (2) Clothing based person recognition, which assumes that people who 
wear similar clothing and appear in photos taken in one day are very likely to be tlie same person. To 
explore our semi-automatic strategies, we designed and implemented a prototype called SAPHARI (Semi- 
Automatic PHoto Annotation and Recognition Interface). Tlie prototype pro\'ides an annotation framework 
which focuses on making bulk annotations on automatically identified photo groups. Tlie prototype 
automatically creates photo clusters based on events, people, and file metadata so tliat users can easily 
bulk annotation photos. We performed a series of user studies to investigate tlie effectiveness and usability 
of the semi-automatic annotation techniques when applied to personal photo collections. The results show 
that users were able to make annotations significantly faster with event clustering using SAPHARI. We 
also found that users clearly preferred the semi-automatic approaches. [All rights reserved Elsevier]. ( 42 
Refs) 
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Automatic taxonomy generation: Issues and possibilities 
Fuzzy sets and systems : Istanbul, 30 June - 2 July 2003 
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Document Type: P (Serial); C (Conference Proceedings) ; A (Analytic) 
Country of Publication: Germany 
Language: English 

Automatic taxonomy generation deals with organizing text 
documents in terms of an unknown labeled hierarchy. The main issues 
here are (i) how to identify documents that have similar content, (ii) how 
to discover the hierarchical structure of the topics and subtopics, and 
(iii) how to find appropriate labels for each of the topics and subtopics. 
In this paper, we review several approaches to automatic taxonomy 
generation to provide an insight into the issues involved. We also describe 
how fuzzy hierarchies can overcome some of the problems associated with 
traditional crisp taxonomies. 

Fnglish Descriptors: Text; Taxonomy; Automatic generation: 
Hierarclii/cd str ucture; Label; Fuzzy logic 

French Descriptors: Texte; Systematique; Generation automatique; Structure 
hierarchisee; Etiquette; Logique floue 
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Author AffiUation: Mciosoft Res., Redmond, WA, USA 

Conference Title: ICICS-PCM 2003. Proceedings of the 2003 Joint Conference of the Fourth International 
Conference on Information, Communications and Signal Processing and Fourth Pacific -Rim Conference on 
Multimedia (IEEE Cat. NO.03EX758) Part Vol.1 p. 6-10 Vol.1 
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Publication Date: 2003 Country of Publication: USA xlvii+1986 pp. 
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Conference Title: ICICS-PCM 2003. Proceedings of the 2003 Joint Conference of the Fourth International 
Conference on Information, Communications and Signal Processing and Fourth Pacific -Rim Conference on 
Multimedia 

Conference Sponsor: Microsoft Res. Asia; Singapore Exhibition and Convention Bureau 
Conference Date: 15-18 Dec. 2003 Conference Location: Singapore 

Language: English Document Type: Conference Paper (PA) 
Treatment: Practical (P); Experimental (X) 

Abstract: This paper presents photo table of contents (PhotoTOC), a system that helps users find digital 
photographs in their own collection of photographs. PhotoTOC is a browsing user interface that uses an 
overview+detail design. The detail view is a temporally ordered list of all of the user's photographs. The 
overview of the user's collection is automatically generated by an image clustering algorithm, which 
clusters on the creation time and the color of the photographs. PhotoTOC was tested on users' own 
photographs against three other browsers. Searching for images with PhotoTOC was subjectively rated 
easier than all of the other browsers. This result shows that automatic organization of personal 
photographs faciUtates efficient and satisfying search. ( 13 Refs) 
Subfile: B C 
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Class Codes: B6135 (Optical, image and video signal processing); C5260B (Computer vision and image 
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Author Fan, X.; Liu, Q.; Ng, P. 

Author AffiUation: Teleran Technol., East Hanover, NJ, USA 
Journal: Journal of Systems Integration vol.9, no.3 p. 223-62 
Publisher: Kluwer Academic Publishers , 

Publication Date: Oct. 1999 Country of Publication: Netherlands 

CODEN: JSINE4 ISSN: 0925-4676 
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Material Identity Number: 0661-1999-004 
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Abstract: TEXPROS (Text Processing System) is an automatic document processing system which 
supports text-based information representation and manipulation, conveying meanings from stored 
information within office document texts. A dual modeling approach is employed to describe office 
documents and support document search and retrieval. Tlie frame templates for representing document 
classes arc organized to form a document type hierarchy. Based on its document type, the synopsis of a 
document is extracted to form its conesponding frame instance. According to the user predefined criteria, 
these frame instances are stored in different folders, which are organized as a folder organization (i.e., 
repository of frame instances associated with their documents). The concept of linking folders establishes 
filing paths for automatically filing documents in the folder organization. By integrating document type 
hierarchy and folder organization, the dual modeling approach provides efficient frame instance access by 
limiting the searches to those frame instances of a document type within those folders which appear to be 
the most similar to the corresponding queries. This paper presents an agent-based document filing system 
using folder organization. A storage architecture is presented to incorporate the document type hierarchy, 
folder organization and original document storage into a three-level storage system. This folder 
organization supports effective filing strategy and allows rapid fi-ame instance searches by confining the 
search to the actual predicate-driven retrieval method. ( 65 Refs) 
Subfile: C 

Descriptors: information retrieval; office automation; software agents; text analysis 
Identifiers: automated document filing system; TEXPROS; text processing system; automatic document 
processing system; text-based information representation : text-based infomiation manipulation; stored 
information; office document texts; dual modeling approach: document seaich: document retiieval; frame 
templates; document type hierarchy; document synopsis; user predefined criteria; folders; filing paths; 
agent-based document filing system; queries; storage architecture; three-level storage system; predicate- 
driven retrieval method 
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Conference Sponsor: IEEE; IEEE Neural Networks Soc. (NNS); Int. Neural Network Soc 
Conference Date: 12-17 May 2002 Conference 1 ocalion: Honolulu, HI, USA 

Language: English Document Type: Conference Paper (PA) 
Treatment: Practical (P) 

Abstract: In tliis paper, we investigate the use of self-organising maps (SOMs) for document clustering. 
Previous methods using SOMs to cluster documents have used 2D maps. This paper presents a hierarchical 
and growing method using a series of ID maps instead. Using this type of SOM is an efficient method for 
clustering documents and browsing them in a dynamically generated tree of topics. These topics are 
automatically discovered for each cluster, based on the set of documents in a particular cluster. We 
demonstrate the efficiency of the method using different sets of real-world Web documents. ( 24 Refs) 
Subfile: C 

Descriptors: classification; document handling; information resources; pattem clustering; self-organising 
feature maps; tree data structures 

Identifiers: self -organising maps; tree view-based hierarchical document clustering; growing method; ID 
map series; document browsing; dynamically generated topic tree; automatic topic discovery; World 
Wide Web documents 

Class Codes: C7240 (Information analysis and indexing); C6130D (Document processing techniques); 
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Abstract: The paper inlrodiices a new documcnl list viewer and browser, which are convenient tools that 
apply natural language processing to support documcnl ulili/alion. I'he document ulili/alion process 
involves a cycle in which required documents are first retrieved, then analyzed by conlenl. I'he user can 
then formulate ideas and create another document. Botli tools help tlic user perform essential steps in this 
cycle. The document list viewer provides a well orgaiii/eil list oi clocuiuculs by clustering documents 
based on content similarities, and by attaching precise summaries ^cuciaiccl automatically, the user can 
quickly find a required document even from a long list of retrieved documents. With the automatic text 
summarization function, the document browser allows users to skim through a long document on a 
computer display, then interactively creates customized summaries. The paper describes the features of 
these new tools and discusses the requirements for document selection and browsing. ( 6 Refs) 
Subfile: C 
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Document Type: Patent 
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Technologies and Engineering; Aerospace & High Technology 

Abstract: 

A data processor for managing a multilevel application wherein the data processor includes a display 
controller that creates a three-level menu window and a data window. Fhc menu window includes at least 
two levels that each incorporate select commands associalcd w itii discrete subjects wherein menu 
commands are concurrently displayed on screen lo prox idc historical access information. Menu commands 
are converted into display presentations where each level defines a greater degree of information detail on a 
given subject. The data display window for these presentations is further divided into windows for text and 
graphics. The foregoing arrangement has been found to be exceptionally effective at providing 
educational or tutorial information access in a efficient manner. 

Descriptors: Menus; Texts; Dynamical systems; Microprocessors; Dynamics; Multilevel; Education; 
Positioning; Screens 
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